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Abstract. The recent observations of galaxy and dark matter clumpy distributions have provided new 
elements to the understanding of the problem of cosmological structure formation. The strong clumping 
characterizing galaxy structures seems to be present in the overall mass distribution and its relation 
to the highly isotropic Cosmic Microwave Background Radiation represents a fundamental problem. The 
extension of structures, the formation of power-law correlations characterizing the strongly clustered regime 
and the relation between dark and visible matter are the key problems both from an observational and 
a theoretical point of view. We discuss recent progresses in the studies of structure formation by using 
concepts and methods of statistical physics. 
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1 Introduction 

In contemporary cosmological models the structures ob- 
■ served today at large scales in the distribution of galaxies 
in the universe (see Fig.l — discovered by the projects, 
e.g., 2dF PQ, SDSS [213] 1 are explained by the dynamical 
evolution of purely self-gravitating matter (dark matter) 
from an initial state with low amplitude density fluctu- 
ations, the latter strongly constrained by satellite obser- 
vations of the fluctuations in the temperature of the cos- 
, mic microwave background radiation (e.g. the satellites 
COBE0] and WMAP0). Despite the apparent simplicity 
of the scheme, fundamental theoretical problems remain 
open and the overall picture is based on the assumption 
that the main mass component is dark. 

In this theoretical framework one crucial element is 
represented by the initial conditions (IC) of the matter 
density field. Models of the early universe [6] predict cer- 
tain primordial fluctuations in the matter density field, 
defining their correlation properties and their relation to 
the present day matter distribution. When gravity start 
to dominate the dynamical evolution of density fluctu- 
ations, which can generally be described by the Vlasov 
or "collision-less Boltzmann" equations coupled with the 
Poisson equation, perturbations are still of very low am- 
plitude. One of the most basic results (see e.g., [7]) about 
self-gravitating systems, treated using perturbative ap- 
proaches to the problem (i.e. the fluid limit), is that the 
amplitude of small fluctuations grows monotonically in 
time, in a way which is independent of the scale. This lin- 
earized treatment breaks down at any given scale when 
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Fig. 1. Latest progress in redshift surveys. SDSS Great Wall 
(2003) compared to CfA2 (1986) Great Wall at the same scale. 
Redshift distances cz are indicated. The small circle at the bot- 
tom has a diameter of 5 Mpc/ft, the clustering length according 
to the standard interpretation of galaxy correlation. The SDSS 
slice is 4 degrees wide, the CfA2 slice is 12 degrees wide to make 
both slices approximately the same physical width at the two 
walls. (From [8]). 



the relative fluctuation at the same scale becomes of or- 
der unity, signaling the onset of the "non-linear" phase of 
gravitational collapse of the mass in regions of the corre- 
sponding size. If the initial velocity dispersion of particles 
is small, non-linear structures start to develop at small 
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scales first and then the evolution becomes "hierarchical" , 
i.e., structures build up at successively larger scales. Given 
the finite time from the IC to the present day, the devel- 
opment of non-linear structures is limited in space, i.e., 
they can not be more extended than the scale at which 
the linear approach predicts that the density contrast be- 
comes of order unity at the present time. This scale is 
fixed by the initial amplitude of fluctuations, constrained 
by the CMBR, by the hypothesized nature of the dominat- 
ing dark matter component and its correlation properties. 

Observations of large scale galaxy distributions pro- 
vide important tests for these models. On the one hand 
the first question concerns the extension of the regime of 
non-linear clustering and the intrinsic properties of galaxy 
structures. On the other hand according to this scenario, 
at some large scales where fluctuations are still of small 
amplitude, the imprints of primordial correlations should 
be preserved and their detection represents a key obser- 
vation for the validation of the model. 

In order to approach this complex problem, we use 
methods and concepts of modern statistical physics [5] to 
make a bridge between the primordial fluctuation field and 
the development of large scale structure in the universe. 
The first issue we discuss in what follows concerns the cor- 
relation properties of the observed distribution of galaxies 
and galaxy clusters, approaching this problem with the 
perspective of a statistical physicist, exposed to the de- 
velopments of the last decades in the description of in- 
trinsically irregular structures, and by using instruments 
suitable to describe strong irregularity, even if limited to 
a finite range of scales |9|10|llj . These methods offer a 
wider framework in which to approach the problem of 
how to characterize the correlations in galaxy distribu- 
tions, without the a priori assumption of homogeneity 
That is, without the assumption that the distribution in- 
side a given sample is already uniform enough to give to a 
sufficiently good approximation, the true (non-zero) mean 
density of the underlying distribution of galaxies. While 
this is a simple and evident step for a statistical physicist, 
it can seem to be a radical one for a cosmologist. After 
all the whole theoretical framework of cosmology (i.e., the 
Friedmann-Robertson- Walker - FRW - solutions of gen- 
eral relativity) is built on the assumption of an homoge- 
neous and isotropic distribution of matter. The approach 
we propose is thus an empirical one, which surely is ap- 
propriate when faced with the characterization of data. 
Further it is evidently important for the formulation of 
theoretical explanations to understand and characterize 
the data. 

The second question in which the use of methods and 
concepts of statistical physics allow us to clarify an impor- 
tant issue, concerns the correlation properties of the initial 
matter density fields in standard cosmological models. In 
these models the matter density field is described as hav- 
ing small fluctuations about a well defined mean density 
and the initial conditions (i.e., very early in the history 
of the universe) are specified by the so-called Harrison- 
Zeldovich condition. It is here that the concept of "super- 
homogeneity" introduced, for example, in the studies of 



plasma and glass distributions, is relevant, as these mod- 
els describe fluctuations which are in fact of this type. 
Standard type models are indeed characterized by surface 
quadratic fluctuations (of the mass in spheres) and, for the 
particular form of primordial cosmological spectra, by a 
negative power-law in the reduced correlation function at 
large separations [12|13j . The clarification of these prop- 
erties, which correspond to a global fine-tuning of positive 
and negative correlations, allow us to define the strategy 
to measure such signals in real galaxy samples and to iden- 
tify several problems concerning, for example, the effects 
related to sampling (galaxy distribution can be regarded 
as a sampling of the underlying dark matter density field) . 

The third issue in which a statistical physics approach 
maybe useful concerns the theoretical modeling of non lin- 
ear structure formation. Analytical solutions of the Poisson- 
Vlasov equations are very difficult to be formulated and 
the only instrument beyond the linear regime is repre- 
sented by numerical simulations. N-body simulations solve 
numerically for the evolution of a system of N particles in- 
teracting purely through gravity, with a softening at very 
small scales. The number of particles N in the very largest 
current simulations [14j is ~ 10 10 , many more than two 
decades ago, but still many orders of magnitude fewer than 
the number of real dark matter particles (~ 10 80 in a 
comparable volume for a typical candidate). While such 
simulations constitute a very powerful and essential tool, 
they lack the valuable guidance which a fuller analytic un- 
derstanding of the problem would provide. The question 
inevitably arises of the extent to which such numerical 
simulations of a finite number of particles, reproduce the 
mean- field/ Vlasov limit of the cosmological models. The 
theoretical questions concerns the validity of this collision- 
less limit and thus the crucial point is represented by the 
analysis of the "discreteness effects" |15I16I17I18) . 

As already mentioned, although dark matter is sup- 
posed to provide with more than 0.9 of the total fraction 
of the mass-energy in universe (see e.g. |19j). its amount 
and properties can only be defined a posteriori. In addi- 
tion the relation of dark matter to visible matter is still 
not clear and the distribution itself of visible matter re- 
quires more observations to be understood on the relevant 
scales (see e.g. |20|10j ). More than twenty years ago it has 
been surprisingly discovered that galaxy velocity rotation 
curves remain flat at large distances from the galaxy cen- 
ter while the density profile of luminous matters rapidly 
decays (see e.g. [22 )■ This is one of the strongest indica- 
tions of the need from dynamically dominant dark matter 
in the universe. Most attention has been focused on the 
fact that these bound gravitational systems contain large 
quantities of unseen matter and an intricate paradigm has 
been developed in which non-baryonic dark matter plays 
a central role not only in accounting for the dynamical 
mass of galaxies and galaxy clusters but also for provid- 
ing the initial seeds which have given rise to the formation 
of structure via gravitational collapse [7] . In current stan- 
dard cosmological models, various forms of dark matter 
are needed to explain a number of different phenomena, 
while baryons, which can be detected in the form of, for ex- 
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ample, luminous objects such as stars and galaxies, would 
only be the 5% of the total mass in the universe; the rest 
is made of entities about which very little is understood: 
dark matter and dark energy. Very recently there have 
been developed observational techniques which, by mea- 
suring the effect of gravitational lensing in galaxy clusters 
[23; , or by measuring the gravitational influence of struc- 
tures on the CMBR [Mj , are able to reconstruct the three- 
dimensional distribution of dark matter and thus allow 
a comprehension of the relative distribution of luminous 
and dark matters, whose theoretical modeling is still lack- 
ing. These observations have lead to surprising discoveries 
which rise new and crucial questions to the validity of the 
standard interpretation of structure formation [25] , 

2 Initial conditions and super-homogeneity 

The most prominent feature of the IC in the early uni- 
verse, in standard theoretical models, derived from infla- 
tionary mechanisms, is that matter density field presents 
on large scale super- homogeneous features [12] . This means 
the following. If one considers the paradigm of uniform dis- 
tributions, the Poisson process where particles are placed 
completely randomly in space, the mass fluctuations in a 
sphere of radius R growths as R 3 , i.e., like the volume 
of the sphere. A super-homogeneous distribution is a sys- 
tem where the average density is well defined (i.e., it is 
uniform) and where fluctuations in a sphere grow slower 
than in the Poisson case, e.g., like R 2 : in this case there 
are the so-called surface fluctuations to differentiate them 
from Poisson- like volume fluctuations. 

A well known system in statistical physics systems of 
this kind is the one component plasma [13] (OCP) which 
is characterized by a dynamics which at thermal equilib- 
rium gives rise to such configurations. The OCP is simply 
a system of charged point particles interacting through 
a repulsive 1/r potential, in a uniform background which 
gives overall charge neutrality. Simple modifications of the 
OCP can produce equilibrium correlations of the kind as- 
sumed in the cosmological context |13] . 

In terms of the normalized mass variance u 2 (R) = 
(M(R) 2 ) - (M(R)) 2 /(M(R)) 2 , where (M(R)) is the av- 
erage mass in a sphere of radius R and (M(R) 2 ) is the 
average of the square mass in the same volume. Thus for 
a Poisson distribution, where there are no correlation be- 
tween particles (or density fluctuations) at all, one sim- 
ply has cr 2 {R) ~ R~ 3 . For an ordered system character- 
ized by small-scale anti-correlation the variance behaves 
as <J 2 (R) ~ i?~ 4 which is the fastest possible decay for 
discrete or continuous distributions [12]. 

The reason for this peculiar behavior of primordial 
density fluctuations is the following. In a FRW cosmol- 
ogy there is a fundamental characteristic length scale, the 
horizon scale i?#(£). It is simply the distance light can 
travel from the Big Bang singularity t = until any given 
time t in the evolution of the Universe, and it grows lin- 
early with time. The Harrison- Zeldovich (H-Z) criterion 
can be written as cr^(i? = Rnit)) = constant. This con- 
ditions states that the mass variance at the horizon scale 



is constant: this can be expressed more conveniently in 
terms of the power spectrum of density fluctuations |12j 
P(k) = (|(5 p (k)| 2 ) where <5 p (k) is the Fourier Transform of 
the normalized fluctuation field (p(r)—po)/po, being po the 
average density. It is possible to show that H-Z criterion 
is equivalent to assume P(k) ~ k: in this situation mat- 
ter distribution present fluctuations of super- homogeneous 
type given [T2"] . 

The H-Z condition is a consistency constraint in the 
framework of FRW cosmology. In fact the FRW is a cosmo- 
logical solution for a homogeneous Universe, about which 
fluctuations represent an inhomogeneous perturbation: if 
density fluctuations obey to a different condition than 
P(k) ~ k, then the FRW description will always break 
down in the past or future, as the amplitude of the pertur- 
bations become arbitrarily large or small. For this reason 
the super-homogeneous nature of primordial density field 
is a fundamental property independently on the nature of 
dark matter. This is a very strong condition to impose, 
and it excludes even Poisson processes (P{k) = constant 
for small k) p] . 

Various models of primordial density fields differ for 
the behavior of the power spectrum at large wave-lengths, 
i.e., at relatively small scales [5]. However at small k they 
both exhibit the H-Z tail P(k) ~ k which is in fact the 
common feature of all density fluctuations compatible with 
FRW models. Thus theoretical models of primordial mat- 
ter density fields in the expanding universe are character- 
ized by a single well-defined length scale, which is an im- 
print of the physics of the early universe at the time of the 
decoupling between matter and radiation [6] . The redshift 
characterizing the decoupling is directly related to the 
scale at which the change of slope of the power-spectrum 
of matter density fluctuations P(k) occurs, i.e., it defines 
the wave-number k c at which there is the turnover of 
the power-spectrum between a regime, at large enough 
k, where it behaves as a negative power-law of the wave 
number P(k) ~ k m with —1 < m < —3, and a regime at 
small k where P(k) ~ k as predicted by inflationary the- 
ories. Given the generality of this prediction, it is clearly 
extremely important to look for this scale in the data. As 
mentioned in the introduction the range of length-scales 
corresponding to the regime of small fluctuations is lin- 
early amplified during the growth of gravitational insta- 
bilities. According to current models the scales at which 
non-linear clustering occurs at the present time (of order 
10 Mpc) are much smaller than the scale r c , corresponding 
to the wave-number k c , which is predicted to be r c « 124 
Mpc/h (where 0.5 < h < 1 is the normalized Hubble pa- 
rameter) from arguments based on CMBR anisotropies 
[19] . Thus the region where the super-homogeneous fea- 
tures should still be in the linear regime, allowing a direct 
test of the IC predicted by early universe models. 

At the scale r c the real space correlation function £ (r) 
(Fourier transform of the power spectrum) crosses zero, 
becoming negative at larger scales. In particular the cor- 
relation function presents a positive power-law behavior at 
scales r <C r c and a negative power-law behavior (£(r) ~ 
— r -4 ) at scales r 3> r c . Positive and negative correla- 
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tions are exactly balanced in way such that the integral 
over the whole space of the correlation function is equal 
to zero. This is a global condition on the system fluctua- 
tions which corresponds to the fact that the distribution 
is super-homogeneous. 



By considering the observational features of super ho- 
mogeneity one has to take into account that in standard 
models galaxies result from a sampling of the underlying 
dark matter density field: for instance one selects (ob- 
servationally) only the highest fluctuations of the field 
which would represent the locations where galaxy will 
eventually form. It has been shown that sampling a super- 
homogeneous fluctuation field changes the nature of cor- 
relations [26j . introducing a stochastic noise which makes 
the system substantially Poisson (e.g. P(k) ~ constant) 
at large scales. However one may show that the negative 
£(r) ~ r~ 4 tail does not change under sampling: on large 
enough scales, where in these models (anti) correlations 
are small enough, the biased fluctuation field has a corre- 
lation function which is linearly amplified with respect to 
the underlying dark matter correlation function. For this 
reason the detection of such a negative tail would be the 
main confirmation of the super-homogeneous character of 
primordial density field [5]. 



The scale r c marks the maximum extension of pos- 
itively correlated structures: beyond r c the distribution 
must be anti-correlated since the beginning, as there was 
no time to develop other correlations. The presence of 
structures, which mark long-range correlations, whether 
or not of large amplitude, reported both by observations 
of galaxy distributions (as those shown in Fig.l) and by 
the indirect detection of dark matter [23124) is already 
pointing toward the fact that positive correlations extend 
well beyond r c . For example, in [24] it is shown that deep 
counts of radio-galaxies present a dip of about 20 — 45% 
in the surface brightness at the location of a cold spot 
observed in the CMBR anisotropies by the WMAP satel- 
lite. It is then argued that if the cold spot does originate 
from structures at modest redshift, to create, by gravita- 
tional interaction (the integrated Sachs- Wolfe effect), the 
magnitude and angular size of the WMAP cold spot it is 
required a ~ 140 Mpc radius completely empty void. This 
result, if confirmed, shows that (i) there are large-scale 
structures of all matter (dark and visible) extended well 
beyond the possible prediction of current models and that 
(ii) these structures are of very large amplitude. This re- 
sult must be tested in by the analysis of three-dimensional 
galaxy catalogs. Up to now, measurements of large sam- 
ples of galaxy redshifts are not extended enough to reach 
this region, where it is expected that £(r) ~ — r~ 4 , with 
the appropriate and robust statistical properties. Future 
surveys, like the complete SDSS catalog [2], may sample 
this range of scales, but a precise study of the crossover to 
homogeneity, discretization effects, sampling effects and 
statistical noise is still required. 



3 Large scale galaxy distribution 

In the past twenty years observations have provided sev- 
eral three dimensional maps of galaxy distribution, from 
which there is a growing evidence of existence large scale 
structures. This important discovery has been possible 
thanks to the advent of large redshift surveys: angular 
galaxy catalogs, considered in the past, are in fact es- 
sentially smooth and structure-less. In the CfA2 catalog 
(1990) [27], which was one of the first maps surveying the 
local universe, it has been surprisingly observed the giant 
"Great Wall" , a filament linking several groups and clus- 
ters of galaxies of extension of about 200 Mpc/h. Recently 
the SDSS project |2] (2004—2009) has allowed to discover 
the "Sloan Great Wall" which is almost double longer than 
the Great Wall. Nowadays this is the most extended struc- 
ture ever observed, covering about 400 Mpc/h, and whose 
size is again limited by the boundaries of the sample. The 
search for the "maximum" size of galaxy structures and 
voids, beyond which the distribution becomes essentially 
smooth is still one of main open problems. Instead it is 
well established that galaxy structures are strongly irreg- 
ular and form complex patterns. 

The first question in this context concerns the studies 
of galaxy correlation properties. Two-point properties are 
particularly useful to determine correlations and their spa- 
tial extension. There are different ways of measuring two- 
point properties and, in general, the most suitable method 
depends on the type of correlation, strong or weak, charac- 
terizing a given point distribution in a sample. The earliest 
observational studies, from angular catalogs, produced the 
primary result [28) that the reduced two-point correlation 
function £(r) = ^"^"1°^ — 1 (where n is the density of 

points) is well approximated, in the range of scales from 
about 0.1 Mpc/h to 10 Mpc/h, by a simple power-law: 
£(r) » (r/r )~ 7 with 7 w 1.8 and r sa 4.7 Mpc/h. This 
result was subsequently confirmed by numerous other au- 
thors in different redshift surveys (see e.g., [2§]). However, 
while £(r) shows consistently a simple power- law behav- 
ior characterized by this exponent, there is very consider- 
able variation among samples, with different depths and 
luminosity cuts, in the measured amplitude of £(r). This 
variation is usually ascribed a posteriori to an intrinsic 
difference in the correlation properties of galaxies of dif- 
ferent luminosity (see e.g., [53]): brighter galaxies present 
larger values of rg. Theoretically it is interpreted as a real 
physical phenomenon, as a manifestation of "biasing" |30J. 

Such a variation of the amplitude of the measured cor- 
relation function may, however, be explained, entirely or 
partially, as a finite-size effect i.e., as an artifact of statis- 
tical analysis in finite samples. The explanation is as fol- 
lows (see [8]): The reduced correlation function £(r) can 

be written as £(r) = ^ n & p —1, where (n(r)) p is the condi- 
tional density of points, i.e., the mean density of points in a 
spherical shell of radius r centered on a galaxy. The latter 
is generally a very stable local quantity, the reliable esti- 
mation of which at a given scale r requires only a sample 
large enough to allow a reasonable number of independent 
estimates of the density in a shell. The mean density (n) , 
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on the other hand, is a global quantity. The size of a sam- 
ple in which it is estimated reliably is not known a priori, 
but depends on the properties of the underlying distribu- 
tion. Specifically the sample must be large enough so that 
the mean density estimated in it has a sufficiently small 
fluctuation with respect to the true asymptotic average 
density. 

It has been pointed out [31] that, when analyzing a 
point distribution which, like the galaxy distribution, is 
characterized by large fluctuations, one should, in fact, 
first establish the existence of a well defined mean den- 
sity (and ultimately the scale at which it becomes well 
defined and independent of the sample size, if it does) 
before a statistic like £(r), which measures fluctuations 
with respect to such a mean density, is employed. Further 
the existence of power-law correlations, which are clearly 
present in the galaxy distribution, is typical of fractal dis- 
tributions, which are asymptotically empty. In such dis- 
tributions the mean density is always strongly sample de- 
pendent, with an average value decreasing as a function 
of sample size. Given the observation of such correlations 
in the system, and the instability of the amplitude of the 
correlation function £(r) estimated in different samples, 
special care should be taken in establishing first the scale 
(if any) at which homogeneity becomes a good approx- 
imation. The simplest way to do this is in fact to mea- 
sure the conditional density (n(r)) . These quantities are 
generally well defined, and give a characterization of the 
two-point correlation properties of the distribution, irre- 
spective of whether the underlying distribution has a well 
defined mean density or not. A simple power law behavior 
(n(r)) p — Br 1 is characteristic of scale-invariant fractal 
distributions, with the exponent 7 < 3 giving the fractal 
dimension through D = 3 — 7. The pre-factor B is, in this 
case, simply related to the lower cut-off of the distribution 
[8] . If the distribution has a well defined mean density, one 
has, asymptotically, (n(r)) = constant > (i.e., D = 3 in 
the previous formula) . Measurements of this quantity can 
thus both characterize (i) the regime of strong clustering 
and (ii) the scale and nature of a transition to homogene- 
ity. Only once the existence of an average density within 
the sample size is established in this manner does it make 
sense to use £(r). 

Results in past catalogs (see [5] and references therein) 
and in preliminary samples of the SDSS [20111] show (Fig. 2) 
that in the range of scales [0.5,~ 30] Mpc/h galaxy dis- 
tributions are characterized by power-law correlations in 
the conditional density in rcdshift space, with an exponent 
7 = 1.0 ± 0.1. In the range of scales [~ 30,- 100] Mpc/h 
there are evidences for systematic unaveraged fluctuations 
corresponding to the presence of large scale structures ex- 
tending up to the boundaries of the present survey, which 
require a detailed analysis of the problems induced by fi- 
nite volume effects on the determination of the conditional 
density. In addition there are evidences which suggest that 
in such range of scales the power-law index of the condi- 
tional density has a smaller value. However future surveys 
will allow to distinguish between the two possibilities: that 
a crossover to homogeneity (corresponding to 7 = in 
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Fig. 2. Behavior of the conditional density (red dots) in a 
preliminary sample of the SDSS survey [20], together with the 
determination of the conditional density (blue dots) in a sample 
of the CfA2 catalog reported in [21]. There is a substantial 
agreement between the two catalogs and that the new SDSS 
data seem to show a flattening at about 70 Mpc/h. A more 
detailed analysis is required to study this transition and to 
characterize possible finite size effects which may affect this 
behavior. (From [10] ) 

the conditional density) occurs before 100 Mpc/h, or that 
correlations extend to scales of order 100 Mpc/h (with a 
smaller exponent < 7 < 1). 

Finally we note that even if a transition toward a con- 
stant value of the conditional density will be finally de- 
tected this does not imply that the distribution becomes 
uncorrelated on larger scales. In fact, this means that 
structures, beyond the crossover scale, have small ampli- 
tude but they can be very well correlated on larger scales. 
It is then in this situation where the detection of anti- 
correlations, which as discussed above are predicted by all 
models of primordial density fields, become the relevant 
issue to be addressed. 



4 Gravitational many-body problem 

The understanding of the thermodynamics and dynamics 
of systems of particles interacting only through their mu- 
tual Newtonian self gravity is of fundamental importance 
in cosmology and astrophysics. In statistical physics the 
problem of the evolution of self gravitating classical bod- 
ies has been relatively neglected, primarily because of the 
intrinsic difficulties associated with the attractive long- 
range nature of gravity and its singular behavior at van- 
ishing separation. Long-range interacting systems (LRIS) 
present a series of peculiar properties which make them 
qualitatively different from systems in which the interac- 
tions between the component elements are short-range. In 
the case of LRIS every element is coupled to every other 
element in the system and not only with those located in a 
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finite neighborhood around itself. For this reason some of 
the most basic concepts and instruments in physics, e.g. 
the framework of equilibrium statistical mechanics, which 
have been developed for short-range interacting systems, 
cannot be extended to treat LRIS. One of the main feature 
of these systems is that thermodynamical equilibrium is 
not generally reached. 

Gravity is the paradigmatic example of LRIS and the 
peculiar features of self gravitating systems have been 
mainly considered in the context of astrophysics and cos- 
mology. More recently |32j primarily through the study 
of various simplified toy models, it has been shown that 
LRIS generally exhibit a whole set of new qualitative prop- 
erties and behaviors: ensemble in-equivalence (negative 
specific heat, temperature jumps), long-time relaxation 
(quasi-stationary states), violations of ergodicity, subtleties 
in the relation of the fluid (i.e., continuum) picture and 
the particle (granular) picture, etc.. These are commons 
to other physical laboratory systems such as systems with 
unscreened Coulomb interactions and wave-particle sys- 
tems relevant to plasma physics [32] . 

With the aim of approaching the problem of gravita- 
tional clustering in the context of statistical mechanics it 
is natural to start by reducing as much as possible the 
complexity of the analogous cosmological problem and to 
focus on the essential aspects of the problem. Thus we 
consider clustering without the expansion of the universe, 
and starting from particularly simple initial conditions. 
Our recent results suggest that in simplifying we do not 
loose any essential elements which change the nature of 
gravitational clustering |15|16|17|18j . 

The problem of the evolution of self gravitating classi- 
cal bodies, initially distributed very uniformly in infinite 
space, is as old as Newton. Modern cosmology poses es- 
sentially the same problem as the matter in the universe 
is now believed to consist predominantly of almost purely 
self-gravitating particles which is, at early times, indeed 
very close to uniformly distributed in the universe, and at 
densities at which quantum effects are completely negli- 
gible. Despite the age of the problem and the impressive 
advances of modern cosmology in recent years, our un- 
derstanding of it remains, however, very incomplete. In 
its essentials it is a simple well posed problem of classical 
statistical mechanics. 



4.1 Discreteness effects in the linear regime 



We have recently formulated [15|16j a perturbative theory 
of the discrete N body problem which represents an use- 
ful approach to control the problem of discreteness even 
in cosmological simulations in the regime of small fluctu- 
ations, i.e., in the linear regime (see Fig. 3). This situation 
is obtained by using as initial conditions of the problem 
an infinite lattice of particles slightly displaced with small 
or zero initial velocity dispersion. Thus up to a change in 
sign in the force, the initial configuration is identical to the 
Coulomb lattice (or Wigner crystal) in solid state physics 
(see e.g. [33]), and we exploit this analogy to develop an 



approximation to the evolution, in the linear regime, of 
the gravitational problem. 

More specifically, the equation of motion of particles 
moving under their mutual self-gravity is |34j 



E 



Gmimj(x-i — Xj) 



(1) 



Here dots denote derivatives with respect to time t, is 
position of the ith particle of mass m^. We treat a system 
of N point particles, of equal mass m, initially placed on 
a Bravais lattice, with periodic boundary conditions. Per- 
turbations from the Coulomb lattice are described simply 



by Eq. |TJ with and Gr 



-e (where e is the elec- 



tronic charge). As written in Eq. (fTJ the infinite sum giv- 
ing the force on a particle is not explicitly well defined. 
It is calculated by solving the Poisson equation for the 
potential, with the mean mass density subtracted in the 
source term. In the cosmological case this is appropriate 
as the effect of the mean density is absorbed in the Hubble 
expansion; in the case of the Coulomb lattice and of the 
gravitational static case (which we consider here) it corre- 
sponds to the assumed presence of an oppositely charged 
(negative mass for gravity) neutralizing background (see 
discussion in [35]). 

We consider now perturbations about the perfect lat- 
tice. It is convenient to adopt the notation Xj(i) = R + 
u(R, i) where R is the lattice vector of the ith particle, 
and u(R, t) is the displacement of the particle from R. Ex- 
panding to linear order in u(R, t) about the equilibrium 
lattice configuration (in which the force on each particle 
is exactly zero), we obtain 



ii(R,i) 



R' 



D(R-R')u(R',t) 



(2) 



The matrix T> is known in solid state physics, for any in- 
teraction, as the dynamical matrix (see e.g. [33]). It is pos- 
sible to compute the Fourier transform of T>: diagonalizing 
it one can determine, for each k, three orthonormal eigen- 
vectors e„(k) and their eigenvalues w^(k) (n — 1,2,3), 
which obey [33J the Kohn sum rule Yln^nO*) ~ — 47rGpo, 
where po is the mean mass density. 

At this point one may solve Eq|2] by standard tech- 
niques, obtaining that <5(k) ~ exp(A/47rGe„(k)t) where 
<5(k) is the Fourier mode k of the density contrast S(r) = 
(p(r) - p )/po and e„(k) = - %£qJ ■ The eigenvalues are 
represented in Fig. 3 (right panel) for the case of a simple 
cubic lattice: we note that this particular case presents 
both oscillating modes (e n (k) < 0) and modes which grow 
faster (e n (k) > 1) than in the fluid limit (which corre- 
sponds to e„(k) = 1 Vk). 

In the limit that the initial perturbations are restricted 
to wavelengths much larger than the lattice spacing, the 
evolution corresponds exactly to that derived from an 
analogous linearization of the dynamics of a pressure-less 
self-gravitating fluid. Our less restricted approximation al- 
lows one to trace the evolution of the fully discrete distri- 
bution until the time when particles approach one another, 
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Fig. 3. Initial condition for a N-body simulation corresponding 
to a perturbed lattice (left). In this situation density pertur- 
bations are small and a linear analysis of the discrete problem 
allows one to identify a spectrum of eigen-values (right) cor- 
responding to different time scales of collapse for the various 
wave-length of the perturbations. In the fluid limit the time 
scale is the same for all modes and, in these units, equal to 
one. (From [15]). 

with modifications of the fluid limit explicitly depending 
on the lattice spacing. Thus one can understand exhaus- 
tively the modifications introduced, at a given time and 
length scale, by the finiteness of N. 

4.2 Toward the understanding of non-linear regime 

In an infinite space, in which the initial fluctuations are 
non-zero and finite at all scales, the collapse of larger and 
larger scales will continue ad infinitum. The system can 
therefore never reach a time independent state, thus never 
reaching a thermodynamic equilibrium. One of the impor- 
tant results from numerical simulations of such systems in 
the context of cosmology is that the system nevertheless 
reaches a kind of scaling regime, in which the temporal 
evolution is equivalent to a rescaling of the spatial vari- 
ables [M] . This spatio-temporal scaling relation is referred 
to as "self-similarity". 

The evolution from above mentioned shuffled lattice 
(SL) initial conditions converges, after a sufficient time, to 
a "self-similar" behavior, in which the two-point correla- 
tion function obeys a simple spatio-temporal scaling rela- 
tion. The time dependence of the scaling is in good agree- 
ment with that inferred from the linearized fluid approx- 
imation. Between the time at which the first non-linear 
correlations emerge in a given SL and the convergence to 
this "self-similar" behavior, there is a transient period of 
significant duration. During this time, the two-point corre- 
lation function already approximates well, at the observed 
non-linear scales, a spatio-temporal scaling relation, but 
in which the temporal evolution is faster than the asymp- 
totic evolution. This behavior can be understood as an 
effect of discreteness, which leads to an initial "lag" of the 
temporal evolution at small scales. The non-linear corre- 
lations when they first develop are very well accounted 
for solely in terms of two-body correlations. This is nat- 
urally explained in terms of the central role of nearest 
neighbor (NN) interaction in the build-up of these first 
non- linear correlations [35]. This two-body phase extends 
to the time of onset of the spatio-temporal scaling, and 



thus the asymptotic form of the correlation function is 
already established to a good approximation at this time. 

This situation has lead us to consider the comparison 
of the evolution of such a system and that of "daugh- 
ter" coarse-grained (CG) particle distributions [TB] (see 
Fig. 4). These are sparser (i.e., lower density) particle dis- 
tributions, defined by a simple coarse-graining procedure, 
which share the same large-scale mass fluctuations. In 
the numerical simulations the CG particle distributions 
are observed to evolve to give, after a sufficient time, 
two-point correlation properties which agree well, over 
the range of scales simulated, with those in the origi- 
nal distribution. Indeed both the original system and its 
coarse-grainings converge toward a simple dynamical scal- 
ing ( "self-similar" ) behavior with the same amplitude. The 
characteristic time required for the CG system to begin to 
reproduce the clustering in the original particle distribu- 
tion at scales below the CG scale increases as the latter 
scale does. These observations are all very much in line 
with the qualitative picture of the evolution of clustering 
widely accepted in cosmology: the CG distributions share 
the same fluctuations at large scales and it is these initial 
fluctuations alone, to a very good approximation, which 
determine the correlations which develop at smaller scales 
at later times. 

As discussed above once particles begin to fall on one 
another there is a phase in which very significant non- 
linear correlations develop due to interactions between 
NN pairs of particles. The form of the two-point corre- 
lation function which develops in this phase is very sim- 
ilar to that observed, in the same range of amplitude, in 
the asymptotic scaling regime at later times [35]. Thus 
it appears that it is always possible to choose a CG of 
the original system, which reproduces quite well the non- 
linear correlations in the original system with this "early 
time" , explicitly discrete, dynamics of "macro-particles" 
of the CG distribution. This provides a simple physical 
picture/dynamical model for the generation of the non- 
linear correlation function in the relevant range 

This finding is very different to any existing explana- 
tions of the dynamics giving rise to non-linear correlations 
in N body simulations in cosmology. In this context the- 
oretical modeling invariably assumes that the non-linear 
correlations observed in simulations in this range should 
be understood in the framework of a continuum Vlasov 
limit, in which a mean-field approximation of the grav- 
itational field is appropriate. Indeed the fact that self- 
similarity is observed, with a behavior independent of the 
particle density, is usually taken as an indication that such 
a continuum description is appropriate. Our model is man- 
ifestly not of this type, a key element is the discrete NN 
dynamics, while also consistent with the amplitudes of the 
correlation function being independent of particle density. 



5 Conclusions 

The recent observations of galaxy and dark matter com- 
plex clumpy distributions have provided new elements for 
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Fig. 4. Upper panels: Same initial conditions representing a 
randomly perturbed lattice, with different number of points. 
Bottom panels: gravitationally evolved systems. Despite the 
fact that the lower resolution simulation has much less points, 
it traces the same structures of the higher resolution one. The 
identification of the similarities and differences among these 
systems allows one to understand the effects related to the 
finiteness of the number of points in the simulations. (From 

nsi). 



the understanding of the problem of cosmological struc- 
ture formation. The strong dumpiness characterizing galaxy 
structures seems to be present in the overall mass dis- 
tribution and its relation to the highly isotropic CMBR 
represents a fundamental problem. In contemporary cos- 
mological models the structures observed today at large 
scales in the distribution of galaxies are explained by the 
dynamical evolution of purely self-gravitating matter from 
an initial state with low amplitude density fluctuations. 
The extension of structures, the formation of power-law 
correlations characterizing the strongly clustered regime 
and the relation between dark and visible matter are the 
key problems both from an observational and a theoretical 
point of view. 

In this puzzle statistical physics plays an important 
role in various ways, which we have discussed above: (i) 
The complete characterization of the correlations of vis- 
ible and dark matter, (ii) The analysis of the very small 
anisotropies of the CMBR and their implications on the 
initial fluctuations which recall the super-homogeneous 
properties similar to plasmas and glasses, (iii) The dynam- 
ical processes and theories for the formation of complex 
structures from a very smooth initial distribution and in 
a relatively short time. 
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